ALGORITHM FOR GENERATING ORTHOGONAL 
MATRICES WITH RATIONAL ELEMENTS. 



R. A. Sharipov 



Abstract. Special orthogonal nxn matrices with rational elements form the group 
SO(n, Q), where Q is the field of rational numbers. A theorem describing the structure 
of an arbitrary matrix from this group is proved. This theorem yields an algorithm 
for generating such matrices by means of random number routines. 



1. Introduction. 

Orthogonal matrices from the orthogonal group 0(n,K) describe rotations and 
rotations with reflections in the n-dimensional Euclidean space R". Pure rotations 
constitute another classical group SO(ri,R), which is a subgroup in 0(n,R). The 
following matrix represents an elementary rotation in the p-q coordinate plane: 





■ q ■ 



(1.1) 



Here a — cos{lp), (3 — sin((^), and Lp is the rotation angle. Matrices of the form 
(1.1) can be treated as elements of S0(2,M) embedded into SO(?i,M). The matrix 



O* 











(1.2) 



with O* G SO(n - 1, R) then is an element of SO(n - 1, M) embedded into SO(n, 1 
For orthogonal matrices over the field of reals M there is the following theorem. 
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Theorem 1.1. Each orthogonal matrix O G SO(n, R) can be represented as a 
product O — 012] i^i) ■ ■ ■ ■■ 0[",T^](<Pn-i) ■ ^, where the first n — 1 terms are matrices 
of elementary rotations (1-1), and Q is an orthogonal matrix of the form (1.2). 

The angles tpi, ... , ipn-i in the theorem 1.1 are known as Euler angles (see 
Chapter Vll in [1] for Euler angles in the three-dimensional case 71 = 3). These 
angles are restricted by the inequalities 



< (^1 < 2n, 



^ ^ TT for i — 2, 



Applying the theorem 1.1 recursively to O, then to O* in (1.2), and so on, we easily 
prove the following theorem. 

Theorem 1.2. Each orthogonal matrix O G S0(7i, M) can be represented as a 
product of n{n — l)/2 matrices of elementary rotations (l-l). 

The theorem 1.1 is not valid for orthogonal matrices over the field of rational 
numbers. As for the theorem 1.2, 1 don't know if it is valid or not for O G SO(n, Q). 
However, there is an algorithm for constructing orthogonal matrices over the field 
of rational numbers. This algorithm is exhausting, i.e. each orthogonal matrix 
O G SO(n, Q) can be obtained by applying this algorithm. 



2. Stereographic projection. 

Let 6 be the unit sphere in M". We take the point S with the coordinates 
(0, . . . ,0,-1) for the south pole on this sphere. Then the equatorial hyperplane 
a is given by the equation .t" = 0, where x" is the n-th coordinate of a point of 

R" (we use upper indices for coordinates 
of vectors and points according to Ein- 
stein's tcnsorial notation, which is pop- 
ular in differential geometry and in gen- 
eral relativity). Let X be an arbitrary 
point of the unit sphere 6 and let [SX) 
be the ray starting at the south pole S and 
passing through the point X <E &. This 
ray crosses the equatorial hyperplane at 
some definite point Y (as shown on fig- 
ure 2.1 for three dimensional case). For 
each X point Y is unique. If we denote 
by 6° the unit sphere S with the pinned 
off south pole S, then we get the mapping 
f : 6° a that maps 6° onto equato- 
rial hyperplane a. This map is called the stereographic projection. It is bijective 
and smooth. It's very important for us that the stereographic projection is given 
by rational functions. Indeed, if x^, ... , x" are the coordinates of some point 
X G 6° and if y^, ... , y""^ are the coordinates of the point Y = f{X), then for 




Fig. 2.1 



y 



y" 



y 

we get the following formulas: 



1 + x^' 



for 5=1, 



, n — 1. 



(2.1) 
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The inverse mapping / ^ : Y ^ X is also given by rational functions. Indeed, for 
the n-th coordinate x" we have the formula 

^" = - E(2^')') / (l + E(2/^)') • (2.2) 
For the other coordinates of the point X from (2.1) and (2.2) we derive 



where s = 1, . . . , n — 1. (2-3) 



Rationality of the expressions (2.1), (2.2), and (2.3) mean that rational points of 
the pinned sphere &° are in one-to-one correspondence with rational points of the 
equatorial hyperplane a. As for the south pole S, it is usually associated with the 
infinite point y = oo on a. Indeed, passing to the limit y — > oo in the formulas 
(2.2) and (2.3), one can get the coordinates of the point S. 

3. Orthogonal matrices and ONB's. 

Let O G SO(n,Q) be some orthogonal matrix. Its columns can be treated as 
vectors in Q". Let's denote them ei, ... , e„. Orthogonality of O means that the 
transposed matrix coincides with the inverse matrix 0~^: 

0^ = 1. (3.1) 

Written in explicit form the equality (3.1) means that ei, ... , e„ are vectors of the 
unit length perpendicular to each other. They form a so called ONB (orthonormal 
basis) in Q" with respect to the standard scalar product 

n 



(X, Y) = ^a;'2/ 



1=1 



Matrices from the special orthogonal group S0(7i, Q) obey the additional restriction 



detQ = l. (3.2) 

For the basis vectors ei, ... , e„ the restriction (3.2) means that ei, ... , e„ form a 
right oriented ONB (or a right ONB). In three dimensional space Q'^ this property 
can be visualized. 

Orientation rule. Vectors ei, 62, 63 form a right triple in Q'^ if when looking 
from the end point of the third vector es the shortest rotation from the first vector 
ei to the second vector 62 is observed as a counterclockwise rotation. 

In physics (in electricity and magnetism) this rule is formulated as the rule of a 
right screw and also as the rule of a left hand (see [2] and [3]). In multidimensional 
spaces n > 3 one cannot visualize the concept of left and right since a human 
has no visual experience of living in such spaces. However, one can understand 
it mathematically by means of the theory of determinants and skew-symmetric 
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polylinear forms. Indeed, one should first prescribe right orientation to the standard 
basis in Q" composed by the vectors 





1 



















1 







El = 




, E2 = 




, . . . , E„ — 
















1 



then one should consider the transition matrix S relating the standard basis (3.3) 
with another basis ei , ... , e„ . Its components are defined by the equahty 

e,=Y,Si-Ej, i = l,...,n. (3.4) 

Definition 3.1. A basis ei, . . . , e„ in Q" is called right oriented if dct S > 0. 
Otherwise, if det S < 0, this base is called left oriented. 

Another method of defining orientation in Q" uses skew-symmetric polylinear 
forms. Remember that a polylinear n-form in Q" is a Q-numeric function with n 
vectorial arguments to = w(Xi, . . . , X„) which is linear with respect to each its ar- 
gument. A form u! is called completely skew-symmetric if its value changes the sign 
under permutation of any pair of arguments in it. It is known that a completely 
skew-symmetric n-form in Q" is determined uniquely up to a scalar factor. There- 
fore there is exactly one n-form lo normalized by the condition a;(Ei, . . . , E„) = 1. 
It is called the volume form. For the basis ei, ... , e„ we have the equality 

cj(ei,... ,e„) = det5 • w(Ei, . . . ,E„), 

where the matrix S is determined by the formula (3.4). This means that the value 
of the volume form lu can be used as a measure of orientation for bases in Q". 

Note that if a basis ei, ... , e„ is composed by the columns of an orthogonal 
matrix O, the transition matrix S in (3.4) coincides with O. This means that 
constructing an orthogonal matrix O £ SO(n,Q) is equivalent to choosing some 
right oriented ONB in Q". One vector in this base (say last vector e„) can be 
constructed by means of the stereographic projection as described in section 2 
above. This is the first step in our algorithm for constructing orthogonal matrices. 
Then we should complement it with other n — 1 vectors which should be unit vectors 
perpendicular to each other and perpendicular to the vector e„ as well. Below we 
use the Cayley transformation for this purpose. 

4. Cayley transformation. 

Let A be a skew-symmetric n x n square matrix. It is known that all eigenvalues 
of a skew-symmetric matrix are purely imaginary numbers (some o them can be 
equal to zero, but they cannot be nonzero real numbers). Therefore det(l — A) is 
nonzero. Let's consider the matrix O = (1 + A) ■ {1 — A)^^ . Matrices 1 + A and 
1 - A commute with each other, hence (1 + A) ■ (1 - A)"^ = (1 - ^)"^ • (1 + ^)- 
For this reason we write = (l + yl)-(l — ^)^^ as a fraction 



(4.1) 
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The formula (4.1) is known as the Caylcy transformation (sec [4]). If A is skew- 
symmetric, as it is in our case, then O is orthogonal matrix with detO = 1. The 
Caylcy transformation defines a mapping so(??, Q) SO(n, Q) similar to the ex- 
ponential mapping exp : so(n,M) — > SO(?i, M). But, in contrast to the exponential 
mapping, it is rational, which is worth for our purposes. The Caylcy transformation 
is an injcctive mapping. Indeed, if a matrix O is obtained by means of the formula 
(4.1), one can recover the matrix A by means of the formula 



A 



O-l 
O + l 



(4.2) 



However, one cannot apply the formula (4.2) to an arbitrary matrix O £ SO(n, Q). 
Matrices with eigenvalues A = — 1 are not suitable. This means that the Cayley 
transformation is not surjective. Below we modify the Cayley transformation and 
convert it into an algorithm able to produce each matrix O £ SO(n, Q). 

We define some special skew-symmetric matrix. Taking n — 1 rational numbers 



y 



y" 



we denote by A[y^, 



,y" 



^] the matrix of the form 



A[y\ 







-y 





,n-l 



,Tl-l 



(4.3) 



Substituting (4.3) into (4.1), we obtain the orthogonal matrix 

,.„_n 1 + A[y\. 



0[y\. 



,y 

,y' 



n-ll 



(4.4) 



By direct calculation one can find that the n-th column in the matrix (4.4) coincides 
with the unit vector e„ constructed by means of the stereographic projection in 
section 2. It's components arc given by the formulas (2.2) and (2.3). Let's denote 
by ei, ... ,e„_i other n — 1 columns of the matrix 0[y^, .. . ,y"'~^]. Completed by 
the vector e„, they form an ONB in Q". Here are the formulas for the components 
of the fc-tli vector e^, where k ^ n: 



= 1- 



-2/2/^ 




i=l 



it) 



i\2 



for s ^ k^ s ^ 



n-l 

E 



(4.5) 



We can pass to the limit F ^ oo in (2.2) and (2.3). However, we cannot pass to 



6 



R. A. SHARIPOV 



this limit in (4.5). For the infinity point y = oo wc set by definition 

1 ... 



oH= 




Theorem 4.1. Each unit vector e e 



1 

0-10 
0-1 



(4.6) 



given by its stereographic coordinates 



, y" is canonically associated with some orthogonal matrix O G S0(7i, Q). 

The formulas (2.2), (2.3), (4.5), and (4.6) give an explicit proof of the theorem 4.1. 
By construction the vector e is the n-th column in the matrix O = 0[y^ , . . . , y'^~^]. 

Now let O be some arbitrary matrix from the special orthogonal group SO(n, Q). 
We denote by e its n-th column. This is the unit vector with rational components 
x^, ... , a;". Suppose that y^, ... , y""^ are its stereographic coordinates (see for- 
mula (2.1) in section 2). They are also rational numbers. Therefore we can construct 
the orthogonal matrix 0[y^, . . . , as described above. As a result we get two 

orthogonal matrices O and 0[y^, . . . with the same n-th column in them. 

This is possible if and only if these matrices are bound by the relationship 



0^0[y\ 



,y 



(4.7) 



where 51 is an orthogonal matrix of the form (1.2). Thus we have proved a theorem. 

Theorem 4.2. Each orthogonal matrix O G SO(n,Q) can be represented as a 
product (4.7), where is a blockwise diagonal matrix determined by some element 
of the orthogonal group SO(n — 1,Q). 

The theorem 4.2 is an analog of the theorem 1.1, while the rational parameters 

Applying this theorem 



y" ^ are analogs of Euler angles ipi, ■ ■ 
recursively, for O E SO(n,Q) we get the equality 



y 



0^0[yl. 



,yi-'] 



0[yl 



(4.8) 



Theorem 4.3. Each orthogonal matrix O G SO(n,Q) can be constructed by means 
of the algorithm described above in the form of a product (4.8). 

The theorem 4.3 is an analog of the theorem 1.2. Note that the number of 
rational parameters in (4.8) is equal to n{n — l)/2. This is exactly the same 
number as in the statement of the theorem 1.2. 



5. Concluding remarks. 

I am not a specialist in algebra and I am not a specialist in number theory. 
I have encountered the problem of generating orthogonal matrices with rational 
elements in designing computer programs for testing students (see [5]). Therefore 
it's quite possible that all of the above results are not new. However, I hope that 
being gathered in one paper and formulated as a computational algorithm they 
could be useful for practical purposes. In addition, I have collected some references 
(see [6-12]) related to the subject of the present paper. 
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